Estimate-based goodness-of-fit test for large sparse multinomial distributions

نویسندگان

  • Sung-Ho Kim
  • Hyemi Choi
  • Sangjin Lee
چکیده

The Pearson’s chi-squared statistic (X2) does not in general follow a chi-square distribution when it is used for goodness-of-fit testing for a multinomial distribution based on sparse contingency table data. We explore properties of Zelterman’s (1987) D2 statistic and compare them with those of X2 and we also compare these two statistics and the statistic (Lr) which is proposed by Maydeu-Olivares and Joe (2005) in the context of power of the goodness-of-fit testing when the given contingency table is very sparse. We show that the variance of D2 is not larger than the variance of X2 under null hypotheses where all the cell probabilities are positive, that the distribution of D2 becomes more skewed as the multinomial distribution becomes more asymmetric and sparse, and that, as for the Lr statistic, the power of the goodness-of-fit testing depends on the models which are selected for the testing. A simulation experiment strongly recommends to use both D2 and Lr for goodness-of-fit testing with large sparse contingency table data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Goodness of Fit Test For Exponentiality Based on Lin-Wong Information

In this paper, we introduce a goodness of fit test for expo- nentiality based on Lin-Wong divergence measure. In order to estimate the divergence, we use a method similar to Vasicek’s method for estimat- ing the Shannon entropy. The critical values and the powers of the test are computed by Monte Carlo simulation. It is shown that the proposed test are competitive with other tests of exponentia...

متن کامل

Disparity Based Goodness-of-fit Tests for and against Order Restrictions for Multinomial Models

We consider disparity based test statistics to test the equality of a multinomial probability vector to a given probability vector against an isotonic=antitonic order restriction. The problem of testing the isotonic restriction as a null hypothesis against unrestricted alternatives is also considered. In both cases the asymptotic distributions of these test statistics are found to be of the chi...

متن کامل

Package 'emt' Title Exact Multinomial Test: Goodness-of-fit Test for Discrete Multivariate Data

February 15, 2013 Type Package Title Exact Multinomial Test: Goodness-of-Fit Test for Discrete Multivariate data Version 1.1 Date 2013-01-27 Author Uwe Menzel Maintainer Uwe Menzel Description The package provides functions to carry out a Goodness-of-fit test for discrete multivariate data. It is tested if a given observation is likely to have occurred under the assumption...

متن کامل

A New Class of Private Chi-Square Tests

In this paper, we develop new test statistics for private hypothesis testing. These statistics are designed specifically so that their asymptotic distributions, after accounting for noise added for privacy concerns, match the asymptotics of the classical (nonprivate) chi-square tests for testing if the multinomial data parameters lie in lower dimensional manifolds (examples include goodness of ...

متن کامل

Geometry of Goodness-of-Fit Testing in High Dimensional Low Sample Size Modelling

We introduce a new approach to goodness-of-fit testing in the high dimensional, sparse extended multinomial context. The paper takes a computational information geometric approach, extending classical higher order asymptotic theory. We show why the Wald – equivalently, the Pearson χ and score statistics – are unworkable in this context, but that the deviance has a simple, accurate and tractable...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 53  شماره 

صفحات  -

تاریخ انتشار 2009